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SUMMARY 


DISTRIBUTIONAL  RESULTS  FOR  RANDOM  FUNCTIONALS 
OF  A  DIRICHLET  PROCESS 


Aj±  <T 


U  Atv  ,;C 


Me  obtain  jji  expression^ for  the  distribution  function  of  the  random 


variable  /ZdP  where  P  is  a  random  distribution  function  chosen  by  Ferguson's 
(1973)  Dirichlet  process  on  (A,  B )  (A  is  the  real  line  and  B  is  the  o-field 
of  Borel  sets)  with  parameter  a,  and  Z  is  a,  real -valued  measurable  function 
defined  on  (A,  A)  satisfying^/ |z|dct  <  As  a  consequence,  we  show  that  when 
a  is  symmetric  about  0  and  Z  is  an  \odd  function,  then  the  distribution  of 
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J ZdP  is  symmetric  about  0. 
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DISTRIBUTIONAL  RESULTS  FOR  RANDOM  FUNCTIONALS 
OF  A  DIRICHLET  PROCESS 

by 

12  2  t 

Robert  C.  Hannum  *  ,  Myles  Hollander  ,  and  Naftali  A.  Langberg 
University  of  Denver  and  Florida  State  University 

1.  Introduction  and  summary.  Let  P  be  a  random  probability  measure 
chosen  by  Ferguson's  (1973)  Dirichlet  process  on  (A,  B )  (A  is  the  real  line 
and  B  is  the  o-field  of  Borel  sets)  with  parameter  a.  Let  Z  be  a  real -valued 
measurable  function  defined  on  (A,  B )  satisfying  / | Z | da  <  •.  Ferguson's 
fundamental  paper  contains  many  results  including,  for  the  purpose  of 
developing  Bayesian  nonparametric  estimators  of  various  parameters,  results 
giving  the  mean  of  such  random  variables  (rv's)  as  /ZdP  and  (variance  P)  and 
the  median  of  the  rv  (median  P).  Yanato  (1977)  extends  Ferguson's  results  by 
obtaining  the  moments  of  various  random  estimable  parameters.  Results  per¬ 
taining  to  the  distribution  functions  (df's)  of  random  functionals  are  less 
available,  an  exception  being  Ferguson's  (1973)  expression  for  the  df  of  the 
random  qth  percentile  of  P. 

In  this  note  we  study  the  df  of  /ZdP.  Our  contributions  are: 
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(i)  In  Theorem  2.5  we  establish  the  equality 


(1.1)  Pr{/zdP  s  x>  «  PrtT*  s  0), 

which  relates  the  df  of  /zdP  to  the  distributions  of  rv's  T*,  x  e  R,  defined 
by  (2.3)  and  Theorem  2.3.  In  Theorem  2.3  we  obtain  the  characteristic  function 
(cf)  of  T*. 

(ii)  The  cf  of  T*  can  be  used  to  obtain  results  concerning  the  distri¬ 
bution  of  / ZdP.  For  example,  in  Corollary  2.6  we  use  the  cf  of  T*  to  prove 
that  when  Z  is  odd  and  o  is  symmetric  about  0,  then  the  distribution  of  /zdP 
is  symmetric  about  0.  (We  also  present  an  alternative  proof  of  Corollary  2.6 
using  the  gamma  process  representation  (see  Ferguson,  1973;  Ferguson  and 
Klass,  1972)  of  the  Dirichlet  process.)  Thus,  for  example,  when  a  is  symmetric 
about  0,  the  odd  moments  of  the  random  P,  namely  /endP(5)»  n  *  1,  3,  5,  ...» 
have  distributions  that  are  symmetric  about  0. 

2.  The  distribution  of  / ZdP.  We  assume  the  reader  is  familiar  with 
definitions  and  basic  results  concerning  Ferguson's  Dirichlet  process.  If 
this  is  not  the  case,  the  reader  is  referred  to  Ferguson  (1973).  Throughout 
this  section  we  take  P  to  be  a  Dirichlet  process  on  (A,  B )  with  parameter  a, 
and  Z  to  be  a  real-valued  measurable  function  defined  on  (/?,  B)  satisfying 
/|z|do  <  •.  We  first  prove  two  lenmas  which  will  be  used  to  show  that  the 
rv's  T* ,  x  «  /?,  defined  by  (2.3)  and  Theorem  2.3,  have  continuous  df's. 

LB*4A  2.1.  Let  r  be  a  positive  integer  and  let  X.,  ...,  X  be  independent 

r  1  r 

and  identically  distributed  (iid)  rv's  such  that  \  X.  has  a  continuous  df. 

i-1  1 

Then  Xj  has  a  continuous  df. 
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PROOF.  Note  that  for  all  y  «  if, 

(2,1)  <**<*!  *  y»r  *  PH  I  X  -  ry).  || 

i«l  1 

LEMMA  2.2.  Let  x  t  fi  and  let  o  be  a  finite  measure  on  (/?,  B)  such  that 

2(«  -  x)  >  0.  where  2  is  a  measurable  function  integrate  with  respect 
to  a.  Uten  there  is  a  positive  integer  r  such  that 

•  • 

/«p{-r/ta[l  .  1 2(Z(()  .  x)2]da»)dt  <  .. 


WOOF.  First  note  that  since  «<{:  z«)  .  ,)  >  0  then  there  exists  two 
Positive  integers  j  snd  r  «  thnt  r  .  .({:  |2(„  .  ,|  2  j-l,  „  t 

®j  *  U(C)  -  x|  i  i  2}  we  then  note  that 
«• 

♦  t2(Z(£)  .  x)2Jdo(£)  n  /  .  t2{zu)  .  x)2)da(E) 

‘j 


and  so  we  have 


/exp(-r/  Anil 
0  .«* 


s  /[I  ♦ 
0 


*  0(8^)  •  Anil  ♦  (t/j)2]. 

*  t  (2(0  -  x}  ]da(0 Jdt  s  /ex p{-r  •  a(B^)  •  An(l  ♦  (t/j)2])dt 
,  -r*a(B.)  » 

(t/j)  ]  j  dt  s  /Ii  ♦  (t/j)2]“ldt  <  ..  || 

0 


Now  we  introduce  some  notation.  Let 


ljk 


{O 


0  -  l)/k  <  2(0  s  j/k},  for  j  -  -k2  ♦  l, 
Z(0  s  -k),  for  j  -  -k2 
Z(0  >  k),  for  j  -  k2  ♦  l. 


k 


2 
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Further  let  W^,  j  ■  -k*,  . ..,  k2  ♦  1,  be  independent  game  rv's  with  shape 
parameters  a(I^k)  respectively,  and  common  scale  parameter  1.  Let  x  «  R  and 
define,  for  k  ■  1,  2,  ... 

x  "i  k2+l 

(2.2)  T  *  -  l  ((j/k)  -  x)W  ♦  l  (to  -  DA]  -  x}W..  . 

j.-k2  j-0 

THEOREM  2.3.  Let  x  e  R.  Then  the  sequence  T^*,  k  ■  1,  2,  ...,  converges 
in  law  as  k  ♦  •  to  a  rv  T*  whose  cf  is  given  by 

«i 

(2.3)  ♦  _(t)  •  exp{-/  toll  -  it(Z(0  -  x}]da(0). 

■T  -• 


PROOF.  Let  6^(1^)  *  1  when  e  t  I  and  0  otherwise,  and  for  each  £  €  R 
define 

-1  k2+l 

ZkCO  -  l  2OA]Miik)  *  I  10  -  DA]MI.k) 

j*-k  *  j-0  *  3* 

for  k  ■  1,  2,  ...  .  Let  x  e  B  and  note  from  (2.2)  that  since  Tfcx  is  a  linear 
combination  of  independent  gamma  rv’s,  it  has  a  cf  which  is  infinitely 
divisible.  Hence  we  may  write 

x  _1 

to  E(exp{itT  ))  -  l  2to  E(exp{it[(j/k)  -  x]W  .})  ♦ 
j*-k 

k2*l 

♦  l  to  E(exp(it[((j  -  l)/k)  -  x)«f.v» 
j-0  3lc 


-1 


-«(!..)  k%l 


l  -to(l  -  it((j/k)  -  x)J  jk  ♦  l  to(l  -  it{  10  -  DA]  -  x}]  ik 


j-k 


j-0 


-1  k2*l 

■  -<  I  2®U1k)in[l  -  it((j/k)  -  x>]  ♦  l  o(I. .) to[l  -  it{[(j  -  DA]  -  x)]) 
j--k  j-0  3 * 


t 


s 


■  -/  tn[l  -  it{Z.  CO  -  x}]do(0 
-•  * 

•  m 

-  -J^in[(l  ♦  t2{Zk(0  -  x)2)%)do(0  ♦  i  /  tan'1[t{Zk(0  -  x}]do(0. 

Now  observe  that  |Zk(Ol  *  |Z(Oj  for  all  real  5  and  so  we  have 

(2.5)  tn[(l  ♦  t2{Zk(0  -  x>2)*]  s  |t|  •  C|Zk(0|  ♦  x>  s  |t|  •  (|Z(0|  ♦  x> 
and 

(2.6)  |tan'1(t(Zk(0  -  x)J|  s  |t|  •  {^COl  ♦  x)  s  |t|  •  (|Z(0|  ♦  x). 

Since  the  right-hand-sides  of  (2.5)  and  (2.6)  are  integrable  with  respect  to 

o  and  since  lin  Z.  (O  ■  Z(0,  then  by  the  doninated  convergence  theorem  we 
k-**» 

have  from  (2.4)  that 

m 

(2.7)  lim  E(exp(itT  x»  »  exp {-/  Jtn[l  -  it{Z(0  -  x)ld«(0>. 

-00 

Writing  the  right -hand -side  of  (2.7)  as  exp{-[G(t,  x)  ♦  iH(t,  x)]>  where 

m 

G(t,  x)  •  *  /  in[l  ♦  t2{Z(0  -  x>2]da(0. 

■mm 

H(t,  x)  -  -/  tan_1[t{Z(0  -  x)]do(0. 

-m 

it  is  straightforward  to  show  that  for  any  real  number  x,  both 
|G(t,  x)  -  G(0,  x) |  and  |H(t,  x)  -  H(0,  x) |  are  bounded  above  by 

m 

1 1 1*  /  (|Z(0|  ♦  x}da(0»  which  goes  to  zero  as  t  +  0.  This  shows  that  the 
right -hand-side  of  (2.7)  is  continuous  at  t  ■  0  and  hence  the  result  of  the 
theorem  follows  from  the  continuity  theorem  for  cf's.  || 
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LEMMA.  2.4.  Let  x  t  R  and  assume  that  o(C:  ZU)  *  x}  >  0.  Then  T*  has 
a  continuous  df. 

PROOF.  By  Lemma  2.2  there  is  a  positive  integer  r  such  that 
/  I  [E(oxpUtTx})]r|dt  <  •.  Now  [ECexpUtT*})]1^  is  the  cf  of  the  convolution 
of  r  iid  rv’s  distributed  as  T* .  Since  the  cf  of  this  convolution  is 
absolutely  integrable  we  conclude  (cf.  Breioan  (1968),  p.  178)  that  the 
convolution  of  r  iid  rv's  distributed  as  T*  is  continuous.  Consequently 
the  result  follows  from  Lemma  2.1.  || 

We  are  now  ready  to  obtain  an  expression  for  the  df  of  / ZdP. 

THEOREM  2.5.  Let  P  be  a  Dirichlet  process  on  (/?,  B)  with  parameter  a 
and  let  Z  denote  a  real -valued  measurable  function  defined  on  (fl,  B) 
integrable  with  respect  to  a. 

(i)  If  there  exists  a  real  number  x  such  that  a{£:  Z(£)  *  x)  »  0, 

then  /ZdP  is  degenerate  at  x. 

(ii)  If  oU:  Z(C)  *  x)  >  0  for  all  x  e  R,  then  (1.1)  holds,  where  T*  is 
the  rv  whose  cf  is  given  by  the  right -hand -side  of  (2.3). 

PROOF,  (i)  Note  that  <*{$:  Z(0  *  x)  -  0  implies  that  P(C:  Z(C)  *  x)  -  0. 

m 

Thus  /  Z(C)dP(C)  •  x. 

mm 

m 

(ii)  Let  \  a  f  Z^(6)dP(5),  for  k  *  1,  2,  ...,  and  note  that 

m 

Pr(A^  s  x>  -  Pr {/  Zk(C)dP(C)  s  x> 

(2.8) 

-1  k*l 

-  PH  l  2[j/k]P(I  )  ♦  l  ((j  -  l)/klP(I..)  s  x). 

j»-K  j-0  3* 


1 


I  •  •  •  * 


Now  for  each  k  the  random  vector  (P(I^k)),  j  »  -k 


k  ♦  1,  has  a 


2  2 

Dirichlet  distribution  with  parameter  (a(I^)),  j  »  -k  ,  k  ♦  1.  Letting 
2  2 

j  ■  -k  ,  ....  k  4-1,  represent  independent  gamma  rv's  with  shape 

2  2 

parameters  a(I^k),  j  -  -k  ,  ....  k  +1,  respectively  and  common  scale 
parameter  1,  and  using  the  gamma  distribution  definition  of  the  Dirichlet 
distribution,  (2.8)  becomes 


k2*l 


k2+l 


Pr(A.  £  x>  -  Pr{  l  2Ij/k]W  *  l  ((j  -  l)/k]W  £  x  £  JH..) 

j--k  3  3*0  3*  j«-k 


-1 


k2+l 


(2.9)  -  Pr{  l  2{(j/k)  -  x)W  ♦  l  {[(j  -  l)/k]  -  x)W  s  0} 

j--k  j-0 


«  Pr(Tk  £  0). 


Now  note  that  lim  A.  ■  /zdP  almost  everywhere  with  respect  to  a  and  thus  by 

k-M» 

(2.9)  we  have 

Pr{ /zdP  £  x)  -  lin  Pr(A.  £  x)  ■  lim  Pr(T  x  £  0)  ■  Pr{Tx  £  0), 
k-**»  K  k-»-~  K 

the  last  equality  following  from  Theorem  2.3  and  Lemma  2.4.  || 

Using  Theorem  2.5  in  conjunction  with  the  cf  of  T*,  given  in  (2.3),  it 
is  now  possible  to  gain  additional  information  concerning  the  distribution  of 
the  random  mean  / ZdP. 

COROLLARY  2.6.  Let  P  be  a  Dirichlet  process  on  (A,  B)  with  parameter  a, 
where  a  is  a  positive  finite  and  symmetric  measure  about  the  origin.  Let  Z 
denote  a  real -valued  measurable  and  odd  function  such  that  / | Z | da  <  «.  Then 
/ZdP  has  a  symmetric  distribution  about  the  origin. 
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PROOF.  First  assume  that  there  is  a  real  number  x  such  that 
a(C:  Z(C)  *  x)  -  0.  Then  since  Z(£)  ■  it  follows  that  a(C:  Z(£)  *  0}  ■  0. 

Thus  by  Theorem  2.S  / ZdP  is  degenerate  at  x  «  0  and  hence  symmetric. 

Next  assume  that  a{£:  Z(£)  *  x}  >  0  for  all  x  e  R.  By  repeating  our 
arguments  it  is  clear  that  Pr{/zdP  a  -  x)  ■  Pr{T*x  *  0}  and  thus  to  show  that 
/ ZdP  is  symmetric  about  the  origin  it  suffices  to  show  that,  for  all  x  «  R, 

(2.10)  Pr{T*  s  0)  =  Pr{-T'X  s  0). 

Since  Z(0  »  -Z(-0  and  since  da(0  =  dct(-S)  we  have 

00 

in  E(exp{ itTx})  *  -/  in(l  -  it(Z(5)  ♦  x}]da(C) 

•  00 

(2.11) 

00 

■  /  in[l  ♦  it{Z(-C)  -  x)]do(-C)  ■  in  E(exp{ -itT*}) . 

•00 

From  (2.11)  we  obtain  that  has  the  same  df  as  -T~x  and  hence  (2.10) 
follows  directly.  | | 

The  proof  of  Corollary  2.6  given  above  is  based  on  Ferguson's  (1973) 
original  definition  of  the  Dirichlet  process.  Below  we  exhibit  an  alternative 
proof  of  Corollary  2.6  which  is  based  on  Ferguson's  (1973)  (also  see  Ferguson 
and  Klass,  1972)  alternative  definition  of  the  Dirichlet  process  as  the  sum  of 
a  countable  number  of  jumps  of  random  height  at  a  countable  number  of  random 
points.  (This  alternative  representation  is  often  useful  in  obtaining  results 
for  the  Dirichlet  process,  cf.  Ferguson  (1973),  Korwarand  Hollander  (1973), 

Yamato  (1977).) 

ALTERNATIVE  PROOF  OF  COROLLARY  2.6.  We  have,  by  Theorem  2  of  Ferguson 


(1973), 


where  V,,  V  ,  ...  are  iid  according  to  Q(- )  ■  a  (-)/a(R),  where  the  P.'s  are  independent 

of  the  Vj's.and  where  the  P^'s  depend  on  <*(•)  only  through  a  (It) .  Thus  we  have 
(since  Z  is  odd) 


(2.12) 


JZdP  -  l  Z(V  )P  -  -  l  Z(-V.)P 


3-1 


J  J 


j»l 


r  j 


Now  since  a  is  symmetric  about  the  origin,  -V.  «  V.,  for  all  j  -  l,  2,  .... 

d  ^  I 

where  means  "has  the  same  distribution  as."  Hence,  since  the  V^'s  are 

independent  of  the  P  ^ ' s  we  have 


(2.13) 


-1  z(-v.)P.  -  -  l  Z(V  )P  -  -/zdP. 
i*i  i  i  j*i  j  j 


From  (2.12)  and  (2.13)  we  conclude  that  / ZdP  »  -/ ZdP  and  so  the  df  of  /zdP  is 
symmetric  about  the  origin.  | | 
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We  obtain  an  expression  for  the  distribution  function  of  the  random  variable  fzdP 
where  P  is  a  random  distribution  function  chosen  by  Ferguson's  (1973)  Dirichlet  process  on 
(. R ,  B)  ( R  is  the  real  line  and  B  is  the  a-field  of  Borel  sets)  with  parameter  a,  and  Z  is  a 
real-valued  measurable  function  defined  on  (£,  B)  satisfying  J|z|da  <  •*.  As  a  consequence,  we 
show  that  when  a  is  symmetric  about  0  and  Z  is  odd  ufunction,  then  the  distribution  of  /ZdP 
is  symmetric  about  0. 


